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Abstract 

This report considers the class of appHcations of sensor networks in which each sensor node makes 
measurements, such as temperature or humidity, at the precise location of the node. Such spot-sensing 
applications approximate the physical condition of the entire region of interest by the measurements 
made at only the points where the sensor nodes are located. Given a certain density of nodes in a 
region, a more spatially uniform distribution of the nodes leads to a better approximation of the physical 
condition of the region. This report considers the error in this approximation and seeks to improve the 
quality of representation of the physical condition of the points in the region in the data collected 
by the sensor network. We develop two essential metrics which together allow a rigorous quantitative 
assessment of the quality of representation achieved: the average representation error and the unevenness 
of representation error, the latter based on a well-accepted measure of inequality used in economics. 
We present the rationale behind the use of these metrics and derive relevant theoretical bounds on them 
in the common scenario of a planar region of arbitrary shape covered by a sensor network deployment. 
A simple new heuristic algorithm is presented for each node to determine if and when it should sense 
or sleep to conserve energy while also preserving the quality of representation. Simulation results show 
that it achieves a significant improvement in the quality of representation compared to other related 
distributed algorithms. Interestingly, our results also show that improved spatial uniformity has the 
welcome side-effect of a significant increase in the network lifetimeP] 
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I. Introduction 

Networks of inexpensive low-power sensor nodes may be deployed to sense, gather and 
process information in a region of interest for a variety of purposes including surveillance, target 
tracking, wildlife monitoring and pollution studies [1]. Based on the expected behavior of indi- 
vidual nodes, these applications of sensor networks may be broadly categorized into two types: 
area-sensing applications and spot-sensing applications. Examples of area-sensing applications 
include enemy surveillance, target tracking, intrusion detection and wildlife monitoring through 
audio/image/video recording; in these applications, sensor nodes make relevant observations 
within a local sensing area using vision, sound, seismic-acoustic energy, infrared energy, or 
magnetic field changes. On the other hand, in spot-sensing applications, each sensor node 
makes measurements of physical phenomena such as temperature, humidity and environmental 
pollution at precisely the spot where it is located, and there is no concept of a sensing area. The 
physical condition of each point in the region of interest is represented in the data collected from 
nearby active sensor nodes. The farther the nearest active nodes are from a point, the poorer 
is the representation of the physical condition at the point in the data collected by the sensor 
network. For example, if most of the active sensor nodes are clustered together in one corner 
of a region, the quality of representation of the region is likely to be poor. A more spatially 
uniform distribution, however, will lead to an improved quality of representation. In this report, 
we consider spot-sensing applications and introduce the problem of improving this quality of 
representation in the data collected by the sensor network. 

The problem of improving the quality of representation is related but different from the 
coverage problems typically considered for area-sensing applications [j2|-[[8|. In most of these 
coverage problems, a region is considered A;-covered if all points in it are within the sensing 
area of at least k active nodes. Such a notion of coverage, while appropriate for area-sensing 
applications, is not relevant for spot-sensing applications where there is no concept of a sensing 
area. In spot-sensing applications, the quality of representation enjoyed by a point in the region 
depends on the desired spatial granularity with which the physical condition needs to be sampled 
and on some function of the distances to the nearest set of active sensor nodes around the point. 
This report introduces new metrics that help evaluate the quality of representation achieved by 
a sensor network deployed in a region. 

Energy being a key constraint in most sensor networks, this work assumes that a sensor node 
can be programmed to make a choice at specific intervals of time on whether it should be in the 
sense mode (also referred in this report interchangeably as the active mode) or the sleep mode (in 
which its sensing module is turned off). An additional goal in spot-sensing applications becomes 
one of developing a distributed algorithm to determine sleep/sense times with the specific goals 
of (i) conserving energy, (ii) achieving the desired spatial granularity with which the physical 
condition in the region is sampled by achieving the appropriate spatial density of active nodes, 
and (iii) finally, achieving a high quality of representation of the region at all times by the 
network of active nodes. 

The metrics for the quality of representation and the above goals of a distributed algorithm 
are also relevant in the context of sensor networks with transducer heterogeneity. It is becoming 
increasingly common in real world sensor network applications to integrate data from several 
different types of transducers [|9|, [10|. Microsensors, especially those using microelectromechan- 



ical systems (MEMS), permit the sensing of a variety of physical phenomena on a single sensor 
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node [[TT]. Sensor nodes such as the Berkeley MICA Mote typically integrate several transducer 
types, such as for acceleration, temperature, light and sound, on a single board [ 12|. Each sensor 
node typically has dynamic control over which transducers are active. Since different physical 
phenomena generally require sensing at different spatial granularities, one can avoid unnecessary 
energy consumption by activating only a subset of transducers at each of the sensor nodes. This 
calls for distributed algorithms executed by all the sensor nodes to automate the process of 
determining which transducers should be activated on which nodes based on the desired density 
of each transducer type while also ensuring that all points in the region are well-represented by 
measurements made by each transducer type. 



A. Problem Statement 

Consider sensor nodes distributed within a certain region of interest, denoted by R. Let 
G' = (V, E') denote the graph of these sensor nodes where each node u &V' represents a sensor 
node and each edge (m, v) E E' represents the fact that nodes u and v are neighbors and can 
communicate directly with each other. Let d{u, v) denote the Euclidean distance between sensor 
nodes represented by vertices u and v. The Euclidean distances between nodes are computable if 
the nodes are all fitted with low-power GPS receivers, or through location estimation techniques 
if only a subset of nodes are equipped with GPS receivers [ |13[ , or by estimating distances based 
on exchanging transmission and reception powers [14J. 

Let z denote the desired spatial density, in number of active nodes per unit area, determined 
based on the spatial granularity with which the physical phenomenon of interest should be sensed. 
Let G = (V, E) denote the subgraph of G' such that v E V iff vertex v represents an active node 
(as opposed to one in sleep mode). In this report, we do not require that G be a connected graph 
(because, in many applications, retrieval of data from sensor nodes may be accomplished through 
mobile gateways p3|). The problem now is one of determining G in a distributed manner so 
that every point in the region is well-represented by the active sensor nodes. 

There are two key aspects to this problem: 

1) What are the metrics that one should use to measure the quality of representation achieved 
by G? 

2) Given the metrics for quality of representation, what is a distributed algorithm that one 
should employ to determine G (i.e., to determine which nodes should sense and which 
nodes should sleep) while achieving a high quality of representation? 



B. Contributions and Organization 

We propose a new problem described in the previous subsection specifically for spot-sensing 
applications in sensor networks. We develop a pair of metrics that together allow a quantitative 
assessment of the quality of representation: the average representation error of the points in 
the region and the unevenness of representation error across the points in the region. Section |ll] 
presents these metrics along with the rationale behind them. Based on the average representation 
error of the points in the region. Section II-A develops a metric normalized by the desired 
spatial density to allow for comparative evaluations of the quality of representation achieved 
by a network across different desired spatial densities. Section |II-B| borrows from the field of 
economics and uses the Gini index, a well-accepted measure of inequality, to develop a new 
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metric for the unevenness of representation error among the points in the region. Section |II-C 



discusses the need for both of these complementary metrics. Lower bounds on both metrics are 



derived in Section II-D Upper bounds on these lower bounds for the common scenario of a 
continuous two-dimensional region covered by a sensor network are derived in Appendix |Aj 
Section Jll] discusses work in sensor networks as well as in other fields which seek to solve 
similar or related underlying mathematical problems. 



Section IV develops a generalized, distributed algorithm, called EvenRep(J^, L), to achieve 
a better quality of representation for points in the region of interest. The algorithm, a simple 
heuristic, is parametrized by two quantities: J-", a target distance function which specifies the 
desired distance between an active node and its fc-th nearest active neighbor and L, the maximum 
number of active neighbors that a node should consider in making its decision to sense or sleep. 
The algorithm seeks to achieve the target distance function for all active nodes. The target distance 
function, J^, can depend on whether the region of interest is 2-dimensional or 3 -dimensional, the 



type of application, or any spatial constraints specific to the region. Section IV-A describes the 
pseudo-code of the algorithm and the rationale behind it. We find that the ideal target distance 
function is one based on the region of interest being tessellated by congruent hexagonal cells 
with an active sensor node at the center of each cell. We denote this target distance function by 



H and Section IV-C describes EvenRep(if, L), used in our simulation results. 

Section |V] presents several simulation results on the performance of EvenRep(i/, L) and some 
other representative algorithms. The results show that the EvenRep(/f, L) algorithm achieves a 
significant improvement in the quality of representation in comparison to other algorithms. We 
show that achieving an improved quality of representation has a welcome side-effect of signif- 
icantly improving the network lifetime. In fact, we show that EvenRep(if, L) achieves almost 
a 50% increase in the network lifetime in comparison to other related distributed algorithms. 



Section VI concludes the report. 



II. The Metrics 

In this section, we develop metrics to quantify the quality of representation achieved in a 
sensor network deployment for spot-sensing applications. Past research that discusses related 
metrics has largely assumed a system model that is more appropriate for coverage problems in 
area-sensing applications [|2|-[[8|. In these problems, each sensor node has a pre-defined sensing 
range and the goal is to ensure that each point in the region of interest is A;-covered, i.e., lies 
within the sensing area of at least k active sensor nodes. As opposed to coverage at a point in 
an area-sensing application, the quality of representation of a point in a spot-sensing application 
is not easily captured in an either-or binary manner, an implication of the fact that there is no 
concept of a sensing area in spot- sensing applications. Even modified coverage problems for 
area-sensing applications, such as when a point is considered either covered or uncovered with a 



probability that is a function of the distance to the nearest sensor node [16|, do end up imposing 
a binary either-or assessment that is not useful to assessing the quality of representation of the 
point. Also, metrics based on the distances between active nodes (e.g., [17]) used in solving 
different problems do not capture the quality of representation for spot-sensing applications, 
a quantity that is more about the points in the region of interest than the distances between 
neighboring active nodes. 
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The quality of representation of a point depends on the error in the representation of the point 
in the data collected by the sensor nodes. As mentioned in Section |I| this error depends on some 
function of the distances between the point and the nearest active nodes. This function may be 
different for different physical conditions and is sometimes known (as discussed in [18|) but, 
most often, is unknown before network deployment. For clarity of presentation, we describe our 
work using the case in which the error in the representation of a point may be assumed to be 
directly proportional to the distance between the point and its nearest active sensor node (the error 
is zero if there is an active sensor node exactly at that point). However, the metrics of quality of 
representation that we develop can be readily adapted to other cases with different relationships 
between representation error at a point and the distances to the nearby active nodes. Further, the 
heuristic algorithm we present later in this report is also independent of this assumption. Also 
for clarity, we present this work assuming that the region covered by the sensor network is a 

2- dimensional plane. The metrics presented here and the algorithm can be readily adapted to the 

3- dimensional case. 

Thus, the average of representation errors at all points in the region, normalized by the desired 
spatial granularity of active nodes for the physical condition being sampled, is one aspect of the 
quality of representation of the region. However, as we will show later in this section, a low 
average representation error alone does not tell the whole story and that an even spread of these 
values is also an essential aspect of the quality of representation. In the following, we formalize 
and develop a rigorous definition of two metrics: the average representation error based on the 
normalized average of the distances of the points to their respective nearest active nodes, and 
the unevenness of representation error based on the distribution of these distances. 

A. Average Representation Error 

Let dp{v) denote the distance of node v from point p. Let np(G) denote the nearest node in 
G (the set of active nodes) from point p. Let dji{G) denote the average value of dp{np{G)) over 
all points p in region R. dji(G) may also be thought of as the expected value of dp{np{G)) for a 
random point p in the region. Intuitively, given the same area of the region of interest and the same 
number of active sensor nodes, the smaller the value of dji{G) the lower the representation error. 
However, as mentioned in Section |I[ the representation error should also depend on the desired 
spatial density of active nodes required for sampling of the physical phenomenon at the point 
(for example, particulate pollution may have to be sampled at a higher spatial granularity than 
temperature and so, the same average distances may not imply the same representation error). 
Different applications can tolerate different average distances between points and the nearest 
active nodes for the same quality of representation; therefore, without knowledge of the desired 
spatial density, z, the value of dj^{G) reveals little about the quality of representation achieved 
for an application. Therefore, an appropriate metric is one that uses the average distance, dji{G), 
normalized by the average distance in the best-case scenario at the desired spatial density. 

The best-case scenario occurs when the region of interest can be covered in a space-filling 
fashion by non-overlapping circular areas with an active sensor node at the center of each 
circular area. Note that such a scenario is not realistic and is used here only as a means to 
derive a normalization factor in the metric. Given a desired spatial density of z, the radius of 
these circular areas in the best-case scenario is given by r = 1/ y/zn (recall that z denotes the 
desired number of active nodes per unit area, the size of each circular area is vrr^, and therefore. 
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(a) One active node at the 
center of a unit square area. 



(b) An active node placed 
at tiie center of each of 
the four quarters of a unit 
square area. 



Fig. 1: An example to illustrate the average representation error as a metric; D(G,R) is the 
same in the two cases when the desired spatial density in (a) is 1 but in (b) is 4. 



z = l/vrr^). The expected distance from points in the region to the nearest sensor node in this 
case is given by: 

27rx 2r 2 



lo vrr^ 3 Sa/^vF 

Thus, the normalized expected distance of points to their respective nearest nodes is given by: 

Dispensing with the constant, 3-y7r/2, we define the average representation error, denoted by 

D{G,R), as: 



D{G,R)=dn{G)yG (1) 

Fig. [T] is illustrative of the average representation error as a metric. Consider a square region 
of interest of unit area. The average representation error in Fig. [la] when the desired spatial 
density is 1 is the same as the average representation error when the desired spatial density is 
4. 



B. Unevenness of Representation Error 

The field of economics has a long history of measuring inequality and a vast body of literature 
on the topic [T9l, pOj . For measuring the unevenness of representation error, we use a popular 
and well-accepted metric in economics, the Gini index, based on the relative mean difference 
between the quantities being compared (in our case, the quantities are distances of points to 
their respective nearest active sensor nodes). Consider m quantities, (71 < (72 < ■ ■ ■ < dm- The 
mean difference between these quantities is: 



m m 

^ = -^J2Y.\9i- 9j\ 



The relative mean difference is the mean difference divided by the mean, g. The Gini index is 
defined as one-half of the relative mean difference, i.e.. 



^ m m 

Gini index = — = I Oi — I 

2g 2^m2^^tl' " 
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Fig. 2: An example illustrating that improving U{G,R) does not necessarily improve D{G,R). 



Adapting the Gini index to the context of our problem poses one issue: the number of quantities 
we have is infinite because of the infinite number of points in any region of interest. Therefore, 
instead of using summations, we consider expected values in defining unevenness. Let p and 
q denote two arbitrary random points in the region of interest R. We define the unevenness of 
representation error, U{G,R), of graph G in the region R as: 

rj(^ o^ ^ E[\d,{n,{G)) - d.MGm 
^ ' ' 2dn{G) 
where p,qeR. The smaller the value of the above quantity, the better the spatial uniformity. 



C. One Metric or Two? 

For both metrics, D(G, R) and U (G, R), a smaller value of the metric implies better quality 
of representation. A legitimate question at this point is whether minimizing one also minimizes 
the other, i.e., whether we need both of the above two metrics or if one of the metrics above 
can serve as the sole metric for measuring the quality of representation. We answer this using 
two simple examples: one in which U{G, R) is minimized but D{G, R) is not; another in which 
the D{G,R) is minimized but U{G,R) is not. 

Does improving the evenness also improve the average? Fig. |2] considers two regions, each of 
unit square area but with different sensor node deployments for a desired spatial density of 1 (i.e., 
we wish to place exactly one sensor node in the region). In the first of the two regions, a sensor 
node is placed at the centroid of the area while in the other region, the node is placed at one 
of the corners of the square region. Note that both D(G,R) and U{G,R) are minimized in the 
former case for the desired spatial density of 1. Also note that D{G, R) is maximized in the latter 
case. We now claim that the unevenness of representation is equal in the two cases and also the 
best achievable. The unevenness of representation error depends on the normalized distribution 
of the distances from the points in the region to the one active sensor node. Dividing the square 
region of interest into four quarters as shown in the region on the left-hand side in Fig. |2} we 
note that this distribution for points within each of the quarters is identical to each other. Since 
the overall distribution of these distances in the full square region of unit area is composed of 
the identical distributions within each of the quarters, the unevenness of representation in the 
region of unit square area in the left-hand side region is the same as that within each of the 
quarters. Now, the sensor node deployment shown in the region on the right-hand side of Fig. [2] 
can be thought of as an enlargement of one of the quarters in the region on the left-hand side, 
and therefore, achieving the same degree of evenness. This example shows that U (G, R) can be 
the minimum possible when D{G, R) is the minimum or the maximum possible. This shows that 
achieving the lowest possible unevenness of representation error does not necessarily achieve 
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(a) A placement in which 
U (G, R) is minimized but 
D{G, R) is not. 



(b) A placement in which 
D{G,R) is minimized but 
U{G,R) is not. 



Fig. 3: An example illustrating that improving D(G,R) does not necessarily improve U{G,R). 



the best average representation error; in fact, sometimes it can even lead to the worst possible 
average representation error. 

Does improving the average also improve the evenness? Consider a toy example of a region 
composed of two one-dimensional regions arranged in the form of a sideways 'T' as shown in 
Fig. [3j Assume that the desired spatial density corresponds to placing two active sensor nodes 



in the region. Fig. 3a illustrates a placement in which U {G, R) is minimized. Fig. 3b shows 
another placement of the two sensor nodes in which D{G, R) is minimized but U{G, R) is not 
minimized. 

The two examples above show that improving the evenness does not necessarily improve the 
average representation error and that improving the average does not necessarily improve the 
evenness of representation error. Therefore, both metrics are essential to gaining insight into the 
quality of representation (though different metrics may rank differently in their importance to 
different applications). 

D. Lower Bounds on D{G, R) and U{G, R) 

For all of our subsequent analysis, it is insightful to have values of the lowest possible average 
representation error and the lowest possible unevenness of representation error. The following 
theorem proves these bounds. 

Theorem 2.1: The lower bound on D{G,R) is 2/(3y/n) ^ 0.376 and the lower bound on 
U{G,R) is 0.2. 

Proof: The lower bounds of both D{G,R) and U{G,R) are achieved when the sensing 
nodes are perfectly evenly distributed such that the region of interest is completely covered by 
non-overlapping circular areas of radius r with a sensing node at the center of each circle. Since 
each circle is identical to all others as far as the distances of all points to their nearest active 
nodes are concerned, D{G,R) and U{G,R) for each circular region are the same as D(G,R) 
and U{G, R) for the entire region. 

The lower bound on D(G,R) is the average distance between the center of a unit circle and 



the points within the circle. This is an easily-derived geometric result [21 1. 

Focusing now on U{G, R), note that the average distance between a point and the center of 
the circle is given by: 

^ r 271X , 2r 

d = / — -xdx = — (3) 
JO 7rr^ 6 
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Consider two random points within such a circle at distances x and y from the center of the 
circle. 

,^ri n r 2vra; r 2Txy , , , , 4r 

E[\x-y\\= — ^/ — -\x - y\dydx = (4) 
Using Eqns. Q and (|3]): 

■ 

The bounds derived above are achieved in a scenario where the region of interest can be 
perfectly covered by non-overlapping circles. The simplest case would be a circular area with 
one sensor node placed at the center. While possible, this is an unlikely scenario for regions 
covered by a sensor network and therefore, in Appendix [A], we consider regions of interest 
of arbitrary shape and prove upper bounds on the lower bounds of D(G,R) and U{G,R). 



Theorem A.l in Appendix \M shows that the lower bounds for regions of arbitrary shape are 



only slightly higher than those for the ideal case proved above. 

in. Related Work 

The problem of designating the mode of a sensor node as either active or sleeping is related 
(though not identical) to the 2-color instance of some versions of the distributed graph coloring 



problem p2[ , [|23|, in which each node takes on one of two colors with the goal to minimize 
the number of neighbors of the same color as itself. While the algorithms in this body of work 
will generally improve the spatial uniformity of active nodes, they do not consider the distances 
between the nodes in their computations and therefore, are limited in their application to the 
problem under consideration. We show this later in Section |V] by simulating the Flip algorithm. 



an adaptation of the algorithm in [24|, in which each node begins with randomly assigning 
itself one of two modes, and then, at random intervals of time, switches to the mode that best 
approximates the active node ratio in its neighborhood. 

Spatially uniform distribution based on distances is more explicitly considered in another body 



of work related to the problem of facility location [25 1, [26 1. The problem involves determination 
of the locations of facilities in an environment (such as emergency services in a city) given 
some constraints and an objective function. In the field of networking, related problems have 
been solved in the context of content distribution networks where one has to replicate resources 
in multiple locations (servers on a network) to boost performance by minimizing delay from 



users to the nearest resource or by achieving load balancing on the network [27[-[30[. A 
variety of techniques, including graph-theoretic approaches, heuristic algorithms and dynamic 
programming, have been employed in these works to arrive at a solution. Ko and Rubenstein 
developed the first distributed algorithm for the placement of replicated resources, best described 
as a solution to the distributed graph coloring problem, by having each node continually change 
its color in a greedy manner to maximize its own distance to a node of the same color [31]. 
This work, which considers the distance between two nodes as that along the communication 
path and not as the geographical distance, cannot be directly applied to the problem considered 
in this report. A further reason this body of work does not directly apply here is that they only 
consider the relationships between nodes and not between the nodes and the points in the region 
of interest. Another set of works consider a set of points in the region of interest as the targets. 
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Fig. 4: The ideal sensor node layout which yields the target distance function, H. 



where the goal is to cover and monitor each target point, as is discussed in (3T\. This set of 
work also does not serve the purpose of achieving good quality of representation because, in 
our case, all points in the region are equally significant targets. 

Points in the region of interest are most explicitly considered in the set of works that propose 
coverage algorithms for sensor networks based on assuming a sensing area for each node in area- 
sensing applications [j2|-[|8|. The goal is usually to ensure that each point in the region of interest 
is within the sensing area of at least k active sensor nodes. Distributed algorithms to achieve 
fc-coverage do not much improve the quality of representation although they do not specifically 
attempt it either. In Section |V| we will compare the quality of sensor node representation and 
the lifetime of a representative member of this class of algorithms with the one proposed in this 
report. 



IV. The EvenRep( J", L) Algorithm 



The design of a distributed algorithm for the problem stated in Section I-A requires that a node 
make an estimate of the quality of representation in its local area in comparison to the desired 
spatial density to reliably determine if it should sleep or go active. Note that even though the 
quality of representation is about the distances of points in the region to the nodes, only nodes 
and not the points can participate in this distributed algorithm and so, we have to use heuristics 
based on distances between nodes to achieve an improved quality of representation for the points. 
A node, therefore, needs to know the expected distances to the nearest active neighbors in a 
target distance function and compare these against the actual distances. Let denote a target 
distance function which specifies a mapping between k > 1 and the target distance between an 
active node and its A;-th nearest active neighbor. The origin of a target distance function, say J-', 
may be the expected distances between neighboring nodes in a given spatial distribution S, but 
targeting in the algorithm is not necessarily the same as targeting S. For the same reason, 
is not necessarily a mapping between k and the expected distance between a node and its k-th 
nearest active neighbor in the spatial distribution S. 

The target distance function, may depend on the environment and on the application. In our 
preliminary work, the EvenCover algorithm [33], we used a 2-dimensional plane with a non-ideal 
target distance function derived from a Poisson point process. In this work, we recognize that 
the Poisson distribution does not offer an ideal target distance function for improving the quality 
of representation. Given a 2-dimensional planar region of interest, it is known that the best 
representation is achieved when the region is tessellated in a space-filhng manner by hexagonal 
cells with a sensor node placed at the centroid of each cell [34j , p5| . This ideal layout is 
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01 


Initialization: 


02 


Turn on sense mode with probability Cz- 


03 


EvenRep(J-, L) Algorithm (executes in a loop): 


04 


do: 


05 


if node is active: 


06 




07 


else: 


08 




09 


Wait for a random length of time between and T 


10 


K -s— min( number of active neighbors, L ) 


11 


Compile list of K nearest active neighbors 


12 


tor I < k < K: 


13 


Xk -i— distance to fc-th nearest active neighbor 


14 




15 


if (0 > znXl): 


16 


if (Q - z7rX| > 0.5): 


17 


Set node to sleep mode 


18 


else: 


19 


Set node to sleep mode with 




probability {Q — znX]^) 


20 


else: 


21 


if (zTT^I - g > 0.5): 


22 


Set node to active mode 


23 


else: 


24 


Set node to active mode with 




probability {zirXj. — Q) 


25 


while true 



Fig. 5: The EvenRepCJ-", L) algorithm executed at each node. 



shown in Fig. |4] and we denote by H the target distance function based on this layout. For a 
3-dimensional region of interest, the ideal layout will be different and likely based on one of the 
space-filling tessellations of 3-dimensional space discussed in [36]. We propose a new algorithm 
which accepts any arbitrary target distance function, J^. Further, the algorithm presented here, 
EvenRepCJ-", L), allows a limit, L, on the number of active neighbors that a node will consider in 
its decision making process. The choice of L in our implementation of the algorithm is discussed 
in Section HV-Cl 

A. Rationale and pseudo-code 

As before, let denote the total number of nodes in the region, A the area of the region 
and z the desired spatial density. Let C denote the active node ratio, the fraction of nodes in 
the region of interest that are active. Note that z is a property of the application and not of the 
sensor network used by the application, while C describes the state of the sensor network. Let 
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Cz = zA/N denote the desired active node ratio. Denote by Tk{J-') the distance between the 
current node to its fc-th nearest active node in the target distance function J^. 

The pseudo-code for the algorithm executed at each node is shown in Fig. [5| Each node can 
compute a quantity Q at the point where it is located based on a comparison between the actual 
distances to its nearest neighbors and their target values. Let K denote the minimum of L and the 
number of active neighbors within the node's communication radius. Let denote the distance 
between the node and its k-th nearest active neighbor. Then, in the case in which the target 
distances are exactly achieved, Q should equal the actual number of active neighbors within a 
radius of Xk- The expected number of nodes within radius Xk at the desired active node ratio 
is zttX]^. a node should stay in its current mode or switch to a different mode depending on 
whether or not the action taken helps bring the Q computed by it closer to zirXj^. For example, 
if the Q computed is lower than that implied in the active node ratio, the node should go into 
the sense mode if not already in the sense mode. 

Let Qi denote the Q computed by node i. The algorithm computes Qi as: 

Q^ = S^ + f:'^ (5) 

k=l 

where j is the distance from node i to its A;-th nearest neighbor, Ki is the minimum of L and 
the number of active neighbors of node i, and 5i is given by: 

f 1, if node i is in sense (active) mode, 
Oi = { (6) 
[0, if node i is in sleep mode. 

Let Ti denote the distance X^- j. If Qi computed as above exceeds z-nr'f by 0.5 or more and 
the node is in active mode, turning it to the sleep mode will bring the local active node ratio 
closer to that corresponding to the desired spatial density. Note that when a node goes from 
active to sleep mode, the number of active nodes in the local region of radius reduces by 1 
and, therefore, comparing the difference between Qi and zurf against 0.5 allows the node to 
best decide if it should sense or sleep so that Qi is as close to zur'j as possible after it makes 
the decision. Similarly, if zTxrf exceeds Qi by 0.5 or more and node i is in sleep mode, turning 
it to the active mode will also bring the local active node ratio closer to that corresponding to 
the desired spatial density. If Qi exceeds zTxrf by less than 0.5, the algorithm sets the node to 
sleep mode with probability equal to Qi — zTxrf (this does not necessarily bring the local spatial 
density closer to the desired value but is an attempt to bring the overall active node ratio of the 
network closer to that corresponding to the desired spatial density). Similarly, if ziirf exceeds 
Qi by less than 0.5, the node is set to active mode with probability z-nrf — Qi. 



B. Complexity analysis 

We now examine the computational, communication and memory complexity of the EvenRep(J-', L) 
algorithm separated from any topology control algorithm running at each node and entrusted 
with maintaining a list of active neighbors within the node's communication radius. Note that 
the execution of line 11 has computational complexity 0(1) because each sensor node has to 
compile K < L active neighbors where L is a small constant. During the execution of lines 
12-14, the node updates Q based on the distances between itself and up to K neighbors, each 
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of which takes 0(1) time. Again, since K < L, a small constant, this too adds a complexity of 
only 0(1). The rest of the algorithm is also 0(1) and therefore, the computational complexity 
of the algorithm at each node is 0(1). The communication complexity of EvenRep(J^, L) is also 
0(1), assuming again that a separate and independent topology control algorithm maintains a 
list of active neighbors. The memory usage of the algorithm is in the order of 0(1). 

Note that the topology control algorithm, running independently of EvenRep(J^, L), may have 
its own communication complexity associated with each node having to broadcast its active state 
to its neighbors, receiving an acknowledgement for it from each of its active neighbors and also 
receiving such state from each of its active neighbors. This communication complexity depends 
on the topology control algorithm used (which can vary greatly depending on the algorithm p7|- 



[39 1) and also on method used to estimate the distance to each active neighbor (such as whether 
it is based on assuming GPS devices in the sensor nodes [13| or on exchanging transmission 
and reception powers [[14l)- 



C. EvenRep(H, L) 

In the following, we focus on = /J as the target distance function and compute Tk{H) 
for use in the EvenRep(if, L) algorithm. In the perfect layout (shown in Fig. |4]), upon which 
the target distance function H is based, each active node has six equidistant active neighbors. 
Assuming an arbitrary distribution of sensor nodes in a region with a given node density z, 
the expected radius of a circular region that contains seven active nodes is r = ^^1 jiixz). To 
achieve a correspondence to a perfectly uniform distribution such as in Fig. |4} the heuristic 
EvenRep(iJ, L) uses a target distance of ^J^J{^lz) from an active node to each of its six active 
neighbors. 

Coincidentally and conveniently, for most topology control algorithms, the number of active 
neighbors is typically 6 or smaller p7|-p9|. Therefore, we provide here the target distance 
function only for A; < 6: 

Tk{H) = \ —, if A:<6 (7) 

The EvenRep(if, L) algorithm analyzed in this report uses the above expression for the target 
distance function H, and with L set equal to 3. This choice of L is based on our finding that 
the fourth neighbor and beyond (i) have a diminishing influence on the quality of representation 
within the local region of the node, and (ii) are better and more effectively considered at other 
nodes for which they are one of the three closest neighbors. 



V. Simulation Results 

Our simulation experiments use 1000 sensor nodes located in a square region of unit area with 
a spatial distribution given by a Poisson point process (each point in the region is equally likely 
to have a node). In our implementation of EvenRep(if, 3), we choose T as equal to 10 units of 
time (recall from line 9 of the pseudo-code in Fig. [5] that each node waits a random length of 
time between and T between making the sense/sleep decisions). The desired spatial density, 
the corresponding active node ratio and the communication radius used in the experiments are 
described as we discuss each of the simulation experiments in the following subsections. 
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(a) Average representation error, D{G,R), achieved by 
EvenRep(i/, 3) plotted against time from to lOT for different 
values of the communication radius. 
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(b) Unevenness of representation error, U{G,R), achieved by 
EvenRep(_ff, 3) plotted against time from to lOT for different 
values of the communication radius. 



Fig. 6: Plots showing the convergence of EvenRep(/7, 3) as the algorithm executes and improves 
the quality of representation. In these experiments, the desired spatial density used corresponds 
to an active node ratio of 0.35. 



Each data point reported in the figures in this section represents an average of 200 different 
simulation experiments (each using a different initial layout of the nodes). Based on the method 
of batch means to estimate confidence intervals, we have determined that the 95% confidence 
interval is within ±1% for each of the data points reported in the graphs. 

Our algorithm begins with each node randomly setting itself to active mode with probability 
equal to C^, the expected active node ratio when the desired spatial density is z. Thus, the 
spatial distribution of active nodes at the beginning of the simulation is given by a finite Poisson 
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(a) Average representation error, D{G,R), achieved by 
EvenRep(i/, 3) plotted against time from to lOT for different 
values of the desired active node ratio. 
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(b) Unevenness of representation error, U{G,R), achieved by 
EvenRep(_ff, 3) plotted against time from to lOT for different 
values of the desired active node ratio. 

Fig. 7: Plots showing the convergence of EvenRep(if, 3) as the algorithm executes and improves 
the quality of representation. In these experiments, the communication radius used is 0.08. 



point process. To understand the reference point at which the simulation begins, we prove an 
additional set of results in Appendix |B] on the two metrics. Theorem B.l in Appendix |B] proves 
that when the active nodes are located in the region with a spatial distribution given by a Poisson 
point process, the expected value of the average representation error, D{G,R), is 0.5 and the 
expected value of the unevenness of representation error, U{G,R), is 1 — l/\/2 f» 0.293. Due 
to the use of a finite Poisson point process for the initial layout of the nodes in the unit area 
in our simulation experiments, border effects cause the initial values of D(G, R) a nd U{G,R) 



to be shghtly larger than 0.5 and 1 — 1/a/2, respectively. Recall from Theorem 2.1 that a lower 
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(a) Average representation error, D{G, R), achieved after time 5T 
by different algorithms plotted against the communication radius. 
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(b) Unevenness of representation error, U{G,R), achieved after 
time 5T by different algorithms plotted against the communication 
radius. 

Fig. 8: Plots comparing the quality of representation achieved by different algorithms. 



bound on D{G,R) is 2/(3A/7r) ^ 0.376 and a lower bound on U{G,R) is 0.2. Therefore, in 
our simulation experiments, one should expect that D{G,R) reduces to something between 0.5 
and 0.376 and that U{G,R) reduces to something between 0.293 and 0.2 after a certain length 
of time since the beginning of algorithm execution. 

A. Convergence 

Fig. [6] shows the convergence of the EvenRep(if, 3) algorithm on the quality of representation 
metrics, D(G, R) and U {G, R), for different values of the communication radius while the desired 
spatial density corresponds to an active node ratio equal to 0.35 (i.e., given 1000 nodes in the 
unit area in our simulations, the desired spatial density, z, corresponds to 350 active nodes). 
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Almost all topology control algorithms achieve an average communication radius at each node 
corresponding to six or fewer neighbors [ |38| , [39|. Therefore, the largest communication radius 
we use is 0.08 units, which corresponds to approximately 6 active neighbors within a node's 
communication radius. 

Fig. I6a| plots the average representation error as the algorithm continues to execute for a 



length of time equal to lOT. Fig. 6b shows the corresponding convergence of the EvenRep(iJ, 3) 
algorithm on the unevenness of representation error metric using the same set of parameters. 
Note that both the average and the unevenness of representation error reduce rapidly as early as 
T (about 2 iterations of the loop between lines 04-25 in Fig. [5] because the expected length of 
time between two iterations is T/2). It is not necessarily true that as the communication radius 
increases, the average representation error, D{G,R), will reduce. This is because, when the 
communication radius is small, in order to achieve the desired active node density more nodes 
than necessary will determine that they should be active since they have limited information about 
the status of other nodes in the region. This reduces the average distance between active nodes, 
thus reducing D(G, R), but does not necessarily improve U{G, R). As one might expect, as the 
communication radius increases, each sensor node has more neighbors and is able to collect 
significantly more relevant information about the quality of representation in its neighborhood, 
thus reducing the unevenness of representation error. 

In our second set of experiments on the convergence properties of EvenRep(if, 3), we keep 
the communication radius constant and vary the desired active node ratio from 0.15 to 0.8. 
We use a communication radius of 0.08 units corresponding to an average of about six active 
neighbors within the radius when the active node ratio is 0.35 (the density used in the previous 



set of experiments). The result is plotted in Fig. |7a] and Fig. |7b] for an interval of time up to 
lOT. Once again, the algorithm appears to converge rapidly within time T, which corresponds to 
approximately 2 executions of the algorithm for each sensor node. Note that the algorithm's per- 
formance does not increase with the increase of the active node ratio. In this set of experiments, 
the algorithm achieves its best performance when the active node ratio is 0.15, the smallest 
ratio used in the experiments. This is expected because, when the active node ratio is high, the 
algorithm has fewer choices in determining the active node layout and becomes more confined 
to the original Poisson distribution of the nodes; on the other hand, when the active node ratio is 
low, the algorithm has more choices in determining which nodes should sense and which should 
sleep, leading to an improved quality of representation. A theoretical proof of the convergence 
of the algorithm remains an open problem. 

It should be noted that, while fast convergence to low values of these metrics is desirable, 
convergence to one particular layout of the active nodes is not desirable. This is because the 
lifetime of a network suffers if nodes, once chosen to be active, remain active forever. 



B. Comparative analysis 

We report results for the following distributed algorithms: 

• The EvenRep(i7, 3) algorithm: The EvenRep(J^, L) algorithm using a target distance func- 
tion F = H (based on a node placement in which the region is covered by non-overlapping 
hexagonal cells with each sensor node covering one cell) and L = 3. The value of Tk{H) 
used in the algorithm implementation is discussed in Section [lV-C| (see Eqn. (|7])). 
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(a) The fraction of nodes alive as the execution of an algorithm 
proceeds. 
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(b) The quality of representation as the execution of an algorithm 
proceeds. 

Fig. 9: Plots showing the network hfetime (the fraction of nodes alive is used as in indication 
of network lifetime) and the degradation in the quality of representation as nodes die. The 
communication radius used is 0.08 units. 



The EvenCover algorithm: This is our preliminary work p3| that employs a target distance 
function derived from a distribution of sensor nodes given by a Poisson point process. 
Sponsored Cover: For a representative coverage protocol that assumes an area-sensing 
application, we choose the well-cited coverage-preserving node- scheduling scheme based 
on sponsored coverage calculations [ [40| . In this protocol, a node decides to go into the 
sleep mode if its entire designated sensing area is also covered by its neighbors. To avoid 
situations in which each of two neighbors expects a certain spot to be covered by the other, 
the protocol implements a random time for which each node delays its decision. 
Flip: In this protocol, each node counts the fraction of its neighbors (including itself) that are 
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active and sets itself into either the sleep mode or the active mode depending on whether or 
not this fraction is larger or smaller than the desired active node ratio. We use this algorithm 
as representative of coverage strategies based on distributed algorithms for graph coloring 
that do not use the distance between the nodes in their computations [ j23| . 



Figs. 8a and 8b plot the performance of these algorithms against the communication radius 
after the algorithms execute for a period of time 5T. The Sponsored Cover algorithm takes the 
communication radius as the sole input while the other three also require the desired active 
node ratio as an input (chosen as 0.35 in these figures). The figures show that EvenRep(iJ, 3) 
achieves the best quality of representation among the algorithms studied. Given that the target 
distance function H is based on a more spatially uniform distribution of nodes than a Poisson 
point process (upon which the EvenCover algorithm is based), one expects the EvenRep(/f, 3) 
algorithm to perform better than EvenCover (as is also observed in these results). The Sponsored 
Cover algorithm seeks full, but not necessarily even, coverage of points in the region. As a 
result, in parts of the region with a denser cluster of nodes, the Sponsored Cover algorithm will 
unnecessarily turn on larger numbers of nodes resulting in poorer spatial uniformity. The Flip 
algorithm, on the other hand, does not consider distances between nodes and therefore, is far 
from being able to achieve a good quality of representation. 

C. Network Lifetime 

In this section, we compare the network lifetime of EvenRep(iJ, 3) with other algorithms. 
In our simulation experiments, we begin with each node allocated a certain amount of energy 
which is expended in the following two ways: 

• On/off broadcast transmission, used to broadcast the node's new status (on/off) to its 
neighbors. 

• Reception, for receiving data and control information from neighbors. 



The power consumption model is adapted from [40 1, with the assumption that each node is 
allocated 0.05J of energy and the data signal is a 2000-bit report message. The transmission 
energy consumption and the reception energy consumption is calculated as follows: 

ET^d) = Eelec X k + 6 f riss-amp X k X (f (8) 

Er^ = E,i,,xk (9) 

where Er^^d) is the energy consumed in transmitting the signal to an area of radius d, E^iec 
is the energy consumed for the radio electronics, e f riss-amp is for the power amplifier and Er^ 
is the energy consumed in receiving the signal. Radio parameters are set as E^iec = SOnJ/bit 
and Ef riss-amp = 10]9 J/bit/m^. The energy consumed in sensing is not considered in this set of 



simulations, exactly as in the model used in [40 1. 



The EvenRep(J-', L) algorithm does not use the concept of rounds while Sponsored Cover and 
Flip do. In order to present a fair comparison, therefore, we use a slightly modified version of 
our algorithm so that each node is scheduled to work in rounds. At the beginning of each round, 
each node randomly picks a start time, t, between and T, and makes its decision based on the 
information received so far. At time t after the start of the round, it switches its state according 
to the decision made. If the node chooses to turn on, then it will broadcast its decision to its 
neighbors. Each of its neighbors will receive the decision and remember it for its own reference. 
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If the node chooses to turn off, it will not make any broadcast attempts. A node is considered 
dead if it has consumed all the power allocated and alive, otherwise. Once a node is dead, it 
cannot be turned on again nor can it broadcast or receive signals. 

A fair lifetime comparison can only be achieved when both the communication radius and the 
active node ratio are the same for each of the four listed algorithms. Recall that the Sponsored 
Cover algorithm does not require the active node ratio as an input. However, since the active 
node ratio achieved by the Sponsored Cover algorithm is a function of the communication 
radius employed, it strictly limits the choice of the desired active node ratio we can use in our 
comparisons. In these experiments, we use a communication radius of 0.08 units for all four 
algorithms. Using this communication radius, we first note the active node ratio achieved by the 
Sponsored Cover algorithm and then use this ratio as the input active node ratio for the other 
three algorithms. 

Fig. [9a| reports the fraction of nodes alive as time progresses to indicate the network lifetime 
(for example, one may define network lifetime as the time until 50% of the nodes are dead). 
Figs. 9a and 9b demonstrate that the EvenRep(iJ, 3) algorithm significantly improves the lifetime 
of the network while also achieving a better quality of representation even as nodes die. Since 
the Sponsored Cover algorithm seeks area-sensing coverage of points in the region, some sensor 
nodes may have to constantly stay active while its neighbors are constantly in sleep mode. 
The goal of full coverage, therefore, contributes to a reduced lifetime while the goal of spatial 
uniformity, appropriate for spot-sensing applications, results in an improved lifetime. The combi- 
nation of striving for spatial uniformity and the use of random chance to place nodes in specific 
modes (as in lines 19 and 24 in Fig. |5]) leads to the difference between the lifetime achieved 
by EvenRep(i7, 3) and that achieved by other algorithms. In addition to improved lifetime. 
Fig. [9b] shows that the EvenRep(iJ, 3) algorithm better preserves the average and the evenness 
of representation error as nodes in the network die, in comparison to other algorithms. This 
indicates that EvenRep(if , 3) achieves a more graceful degradation of the sensor network as the 
battery power in the nodes are exhausted. The EvenCover algorithm, on the other hand, does not 
achieve as good a lifetime. As can be observed from Figs. [SalandlSbl the EvenCover algorithm 
performs as well as EvenRep(i7, 3) when the communication radius is small (corresponding to 
4 or fewer active neighbors). However, a communication radius corresponding to an average 
of 6 active neighbors is a more realistic scenario generated by topology control algorithms and 
EvenRep(i/, 3) performs significantly better in this range of the communication radius. 



VI. Conclusion 

Recent research literature has largely focused on area-sensing applications where the goal is to 
get each point in the region A;-covered. In this report, we turn our attention to spot-sensing appli- 
cations and introduce a new problem with the goal of achieving a good quality of representation 
by activating the sensor nodes in such a way that active nodes are spatially uniformly distributed. 
To the best of the knowledge of the authors, this is the first work that specifically targets quality 
of representation of the points in the region for spot-sensing applications in sensor networks. 
A better quality of representation indicates a shorter normalized average distance between the 
points in the region of interest to their nearest active nodes, and an even distribution of these 
distances. We have developed two complementary metrics to capture the quality of representation 
achieved by a sensor node deployment and used these in our evaluations of different algorithms. 
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We have developed a generalized distributed algorithm called EvenRep(J^, L), which accepts 
two parameters: J^, a target distance function that maps A; > 1 to the desired distance from a 
node to the A;-th nearest active neighbor and L, the maximum number of active neighbors that 
a node will consider in its decision making. We implement an instance of this algorithm using 
a target distance function, H, based on a spatial distribution in which the 2-dimensional region 
is covered by non-overlapping hexagonal cells with a sensor at the centroid of each cell. The 
results show that EvenRep(iJ, 3) achieves a better quality of representation and, very importantly, 
a longer network lifetime than other related distributed algorithms. 

Algorithms designed for area-sensing applications use the sensing radius as the only input 
parameter to determine the sense/sleep status of nodes. A given sensing radius implies a specific 
target spatial density, and vice versa. If the target density is low, it implies a large sensing radius 
and therefore, for most coverage algorithms, a large communication radius and high energy 
costs. Coverage algorithms designed for area-sensing applications, therefore, cannot be adapted 
for spot-sensing applications, especially at lower values of the desired density of active nodes. 
The EvenRep(i7, 3) algorithm, however, works well for spot-sensing applications at all active 
node densities while also achieving a longer lifetime. As a result, the EvenRep(/7, 3) algorithm 
also allows a graceful degradation of the network as nodes die because it preserves the quality of 
representation at all active node densities. Admittedly, EvenRep(/7, 3) is a heuristic. Future work 
in this direction should seek a strong theoretical foundation upon which distributed algorithms 
for improved quality of representation may be based. 



Appendix A 

Given a region of interest, R, of arbitrary shape covered by a graph of active sensor nodes, 
G, let LB{D) denote the lower bound on D(G,R) and let LB(f/) denote the lower bound on 
U{G, R). The following theorem derives an upper bound on these lower bounds. 

Theorem A.l: Given a region of interest of arbitrary shape covered by active sensor nodes. 



LB{D) < (^g + ^ j V2V3 ^ 0.3772 (10) 
LB(f/) < 0.2038 (11) 



Proof: Given a 2-dimensional region of arbitrary shape, we know that perfectly circular 
regions covered by each sensor node (as assumed in the proof of Theorem|2]T) will not achieve 
a space-filling tessellation of the area. Thus, the lower bounds on the quality of representation 
metrics will be higher for regions of arbitrary shape than those in Theorem |2.1[ A 2-dimensional 
plane can achieve a regular symmetric tessellation with only three types of tiles: equilateral 
triangles, squares, or hexagons [41 J . When an active sensor node is placed at the centroid of 
each tile, hexagonal tiles, being closer to a circular shape, achieve better quality of representation 
than triangles or squares. A region of interest of any arbitrary shape can achieve a space-filling 
tessellation with an infinite number of hexagonal tiles, each of infinitesimal size. Therefore, the 
quality of representation achieved by infinite space-filling hexagonal tiles, as shown in Fig. |4} 
is the best coverage that can be guaranteed for regions with an unknown arbitrary shape. The 
metrics D{G, R) and U{G, R) for this scenario represents, respectively, the upper bounds on the 
lower bounds of D{G,R) and U{G,R) for regions of arbitrary shape. 
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Fig. 10: Geometric illustrations supporting the proof of Theorem A.l 



Let r denote the distance between each pair of neighboring active sensor nodes (placed at the 
centroid of each hexagonal tile). Consider each of the innermost triangles (made up of dashed 
hnes in Fig. |4]) with sensor nodes as vertices. Consider the nodes near the center of the region 
(i.e., not at the boundary); each node belongs to six triangles and therefore, each triangle can 
be said to hold 3 x (1/6) = 0.5 nodes. Thus, given z nodes per unit area, the number of these 
triangles that can cover a unit area is z/Q.b = 2z. Note that the area of each triangle is \/3r^/4. 
Since the area covered by 2z of these triangles is 1, we have 2z{V3r'^/4) = 1. Thus, 



(12) 



For any point within each hexagonal tile, the closest sensor node is the one at the centroid of 
the tile. Since all the hexagonal tiles are congruent and identical with respect to the sensor node 
within them, D(G,R) and U{G,R) for the region of interest is the same as the D(G,R) and 
U (G, R) for any one hexagonal tile. Consider one such hexagonal tile, shown by solid lines in 



Fig. 10a The hexagonal tile can be divided into twelve non-overlapping congruent right-angled 



triangles, one of which is shown shaded in Fig. 10a Since the triangles are all congruent and 
also identical with respect to the placement of the nearest sensor node, D(G,R) and U{G,R) 
for the hexagonal cell is the same as the D(G, R) and U (G, R) for each triangle. Consider one 



such triangle, shown in Fig. 10b 



10b Using elementary 



Denote by d the length of the shortest edge of the triangle in Fig. 
geometry, d = y/3r/Q. The triangular area can be divided into two parts: 

• Area A: the area in which the distance between any point in the region to the sensor node 
is no larger than y/Sd. 

• Area B: the area in which the distance between any point in the region to the sensor node 
is larger than y/Sd. 



The two parts of the triangular region are shown in Fig. 10b Thus, the expected distance between 
a point in the triangular area to the sensor node can be computed based on the expected distances 
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from points within each of these two parts: 



a = - -7^ h ' ' ^ 



6 Jo ^^rf2 Jo JV3d ^^ci2 

^ + ^ln3)rf (13) 



Given d = \/3r/Q and Eqns. (Ilj) and (12), we have: 



D{G,R) = rfv^= (^ + ^ln3)c/v^ 



^ + — In 3^ 
9 12 J ^ 

^ 0.3772 (14) 

We now proceed to derive U {G, R). Consider two random points whose distances to the sensor 
node are x and y. There are three cases where the locations of the two points may fall. 

• Case 1: Both points fall within Area A. 

• Case 2: One of the points falls within Area A and the other within Area B. 

• Case 3: Both points fall within Area B. 

We now consider each of the three cases. In the following, — y\]i denotes the expected 

value of \x — y\ weighted by the probability of Case i. 



Case 1: 



Jo Jo ^ ^^d2 x^^2 

y/3n^d 



45 

Case 2: 

The probability that a point falls in Area A is 



(15) 



v^"^ ^xdx 



and the probability that a point falls in area B is: 



'.in{e+f) ydydO 



2 

Define ka as: 



Jv^d ^^c^2 
^/?>d 
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Therefore, 

rkg i-Vsd^ ^"^xdx ydydO 



E4\x-y\]2 = 2 {y-x 

Jo JVSdJo 



JV3dJo ' ^(p ^d? 

-3 9 o 



Case 3: 

The probability that both points fall in Area B is: 



6 / sin(9+^) 



V3d 



Define A;^ as: 



h 

Therefore, 



ydydO \ 
VSd 



sm{r] + f ) 



/f /-^ ^dydrjdxdO 
E4\x-y\\3 = I I \x-y 



Md7o Md' '^(P^(P 



Since -E^[|a; — y|]3 is bounded above as follows: 



2 



= 2(1- 4^)^(2 -v^)d 
o 

Thus, the overall expected difference between distances to the sensor node i 

E[\x-y\] = E^[\x-y\\ + E^[\x-y\]^ + E^[\x-y\] 

+ 7rrf(-- + -v^+ -v^lnS) 



< 



45 '39 6 



+ 2(1- 4^)2(2 -y3)rf 
o 

Using the above inequality with Eqns. ([T]) and ( [T3| ), 

t/(G,i?) < 0.2038 
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Appendix B 

In this section of the appendix, we consider D{G, R) and U{G, R) when the spatial distribution 
of active nodes is given by a Poisson point process. 

Theorem B.l: If the spatial distribution of active nodes is given by a Poisson point process, 
the expected values of D(G,R) and U{G,R) are 0.5 and 1 — l/\/2, respectively. 

Proof: Consider active sensor nodes randomly distributed in the region of interest, R, given 
by a Poisson process of rate z active nodes per unit area. Therefore, the probability that we will 
have k nodes within some area S is given by: 

Pik,S)=^-^^^^ (18) 

In the following, we assume that the region of interest is large enough to ignore boundary issues. 
From Eqn. (fTSl), the probability that there are active nodes within a radius of r is given by: 

P{0,nr')= ^"''"Jf^^^ =e--' (19) 



Consider a ring of radius r of infinitesimal area equal to 2T[rdr. Using Eqn. (18) again and 
noting that 2Txrzdr — )• implies e"^'^^^'^'" — )• 1 — 2nrzdr, the probability that there is exactly one 
active node on this ring is given by: 

P(l, 27irdr) = ^ ~ 27rrz dr (20) 

Thus, the expected distance, between the node and its nearest neighbor is given by: 

POO 

E\Xi] = / rP{0,7rr^)P{l,2TTrdr) 
Jo 

2 

^g-z7rr 2'arzdr 



Using Eqn. ([T]), we get: 



2yfz 

D{G,R) = \ 



(21) 



We now derive the expected value of U{G,R). Let W denote a random variable indicating 
the distance of a random point from its nearest node. The probability density function of W is 
given by: 

Pwir) = 2'Kre 



-wr z 



Consider any two random points whose distances to their respective nearest nodes are x and y. 
Now, 

E[\x~y\] = / pw{x) pw{y)\x -y\dydx 
Jo Jo 

coo PX 

Pw{x) pw{y){x-y)dy dx 
Jo 

+ / Pw{x) pw{y){y - x) dy dx 
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Focusing first on the inner integrals and simplifying, we get the following two results: 



Pwiy){x - y)dy 



e ^'"^ 2-nyz{x - y)dy 



X 



-ziry 



dy 



\ Pw{y){y-x)dy = / e^'^'^y 2nyzdy{y - x)dy 



-zny- 



dy 



Define g{x) as follows: 
Since g{oo) 



1 1 

2 V z' 



9[x) 



-ziry 



-ziry' 



dy 



dy 



Using ( [22| ) and ( [23] ), E[\x — y\] may be expressed as: 

) 

Vw{x)[x 



1 1 



2 V -2 



2g{x)]dx 



2txxz 



Itz 



2g{x) 



dx 



'- + 2e-^-^ g{x)\^ 



Simplifying further using routine calculus, we get: 



E[\x-y\ 



dx 



dx 



(22) 



(23) 



(24) 



Given that the expected distance to the nearest node is 1/2^, using Eqn. (24) in the definition 

of U{G,R), we get: 



U{G,R) 



1(1 



2i V z 

^2^ z 
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